GESTALT: Genomic Steiner Alignments
نویسندگان
چکیده
منابع مشابه
SALSA: Sequence ALignment via Steiner Ancestors
We describe SALSA (Sequence ALignment via Steiner Ancestors), a public{domain suite of programs for generating multiple alignments of a set of genomic sequences. We allow the use of either of the two popular objectives, Tree Alignment or Sum-of-Pairs. The main distinguishing feature of our method is that the alignment is obtained via a tree in which the internal nodes (ancestors) are labeled by...
متن کاملAdaptive BLASTing through the Sequence Dataspace: Theories on Protein Sequence Embedding
A major computational challenge in the genomic era is annotating structure/function to the vast quantities of sequence information now available. This problem is illustrated by the fact that most proteins lack comprehensive annotation, even when experimental evidence exists. We theorized that phylogenetic profiles provide a quantitative method that can relate the structural and functional prope...
متن کاملA Grouping Principle and Four Applications
Wertheimer’s theory suggests a general perception law according to which objects having a quality in common get perceptually grouped. The Helmholtz principle is a quantitative version of this general grouping law. It states that a grouping is perceptually “meaningful” if its number of occurrences would be very small in a random situation: Geometric structures are then characterized as large dev...
متن کاملScoring Pairwise Genomic Sequence Alignments
The parameters by which alignments are scored can strongly affect sensitivity and specificity of alignment procedures. While appropriate parameter choices are well understood for protein alignments, much less is known for genomic DNA sequences. We describe a straightforward approach to scoring nucleotide substitutions in genomic sequence alignments, especially human-mouse comparisons. Scores ar...
متن کاملGeneralized Suffix Trees for Biological Sequence Data: Applications and Implementation
This paper addresses applications of sujjix trees and generalized suffix trees (GSTs) to biological sequence data analysis. We define a basic set of suffix tree and GST operations needed to support sequence data analysis. While those &finitions are straightforward, the construction and manipulation of disk-based GST structures for large volumes of sequence data requires intricate design. GST pr...
متن کامل